Audio-video control system

ABSTRACT

An audio-video control system has a first AV displayer, a second AV displayer, a router and a processor. The first AV displayer is located in a first area and communicatively connected to a first device, and receives a first audio signal in the first area. The second AV displayer is located in a second area and communicatively connected to a second device, and receives a second audio signal in the second area. The router is communicatively connected to the first AV displayer and the second AV displayer. The processor processes the first audio signal to generate a first control signal and processes the second audio signal to generate a second control signal. The processor sends the first control signal to the first device via the router and the first AV displayer and sends the second control signal to the second device via the router and the second AV displayer.

CROSS-REFERENCE TO RELATED APPLICATIONS

This non-provisional application claims priority under 35 U.S.C. §119(a) on Patent Application No(s). 106146360 filed in Taiwan R.O.C. on29 Dec. 2017 the entire contents of which are hereby incorporated byreference.

BACKGROUND 1. Technical Field

The disclosure relates to an audio-video control system, moreparticularly to the audio-video control system with a function ofinternet of things.

2. Related Art

Nowadays, most of the present smart home appliance manufacturers providea smart home appliance control host. Although a plurality of the smarthome appliances may be controlled by the smart home appliance controlhost through the internet of things, the user often needs to installmany extra sensors such as the audio and video sensors. As a result, theextra installation not only costs the user to pay more, but also affectsthe layout of the home environment.

Additionally, when the conventional sensors are installed, thecorresponding parameters have to be set particularly so as to make thesmart home appliance control host be able to recognize where the sensorsare located, so as to provide the correct control to the appliance.However, the setting is more complex as there are more sensors.

SUMMARY

According to one or more embodiment of this disclosure, an audio-videocontrol system is adapted for controlling a first device in a first areaand a second device in a second area. The system comprising a firstaudio-video player, a second audio-video player, a router and aprocessor. The first audio-video player is disposed in the first areaand communicatively connected to the first device. The first audio-videoplayer comprises a first audio sensor for receiving a first audio signalin the first area. The second audio-video player is disposed in thesecond area and communicatively connected to the second device. Thesecond audio-video player comprises a second audio sensor for receivinga second audio signal in the second area. The router is communicativelyconnected to the first audio-video player and the second audio-videoplayer. The processor is communicatively connected to the firstaudio-video player and the second audio-video player through the router.The processor is adapted for processing the first audio signal for togenerating a first control signal, and for the processor processing thesecond audio signal to for generating a second control signal. Theprocessor sends the first control signal to the first device through therouter and the first audio-video player. The processor sends the secondcontrol signal to the second device through the router and the secondaudio-video player.

BRIEF DESCRIPTION OF THE DRAWINGS

The present disclosure will become more fully understood from thedetailed description given hereinbelow and the accompanying drawingswhich are given by way of illustration only and thus are not limitativeof the present disclosure and wherein:

FIG. 1 is the functional block diagram of the audio-video control systemaccording to one embodiment in this disclosure.

FIG. 2 is the functional block diagram of the audio-video control systemaccording to one embodiment in this disclosure.

FIG. 3 is the schematic diagram of the noise suppression processaccording to one embodiment in this disclosure.

DETAILED DESCRIPTION

In the following detailed description, for purposes of explanation,numerous specific details are set forth in order to provide a thoroughunderstanding of the disclosed embodiments. It will be apparent,however, that one or more embodiments may be practiced without thesespecific details. In other instances, well-known structures and devicesare schematically shown in order to simplify the drawings.

Please refer to FIG. 1, which is the functional block diagram of theaudio-video control system according to one embodiment in thisdisclosure. As FIG. 1 shows, the audio-video control system 1000 in oneembodiment of this disclosure comprises a first audio-video player 1100,a second audio-video player 1200, a processor 1700, a network bridge1800 and a router 1900.

For example, the first audio-video player 1100 and the secondaudio-video player 1200 may be a television. The first audio-videoplayer 1100 is communicatively connected to the first device 3000, andthe second audio-video player 1200 is communicatively connected to thesecond device 5000. In addition, the first audio-video player 1100 maybe disposed in the first area such as the master bedroom, and the secondaudio-video player 1200 may be disposed in the second area such as theliving room.

The first audio-video player 1100 may comprise a router/hub/access point(AP) (not shown in the figure), so that the first device 3000 may usethe first audio-video player 1100 as a base station or a router. And thefirst device 3000 is communicatively connected to the router 1900through the first audio-video player 1100 or the network componentthereof. Hence, the communication connection between the firstaudio-video player 1100 and the processor 1700 is implemented.

Similar to the first audio-video player 1100, the second audio-videoplayer 1200 may also comprise a router/hub/access point (AP) (not shownin the figure), so that the second device 5000 may use the secondaudio-video player 1200 as the base station or the router. And thesecond device 5000 is communicatively connected to the router 1900through the second audio-video player 1200 or the network componentthereof. Therefore, the communication connection between the secondaudio-video player 1200 and the processor 1700 is implemented.

The description hereinafter is an example of the first audio-videoplayer 1100, and since the second audio-video player 1200 has thesimilar structure and function, the corresponding descriptions are notillustrated herein. The first audio-video player 1100 comprises a firstaudio sensor 1110, a first video player 1120, a first audio player 1130and a network component 1140. The first audio sensor 1110 is adapted forreceiving a first audio signal in a first area. The first video player1120 and the first audio player 1130 are able to play audio signals andvideo signals respectively at the same time, and the function such asthe audio-video player of the television may be implemented. Also, thenetwork component 1140 is used as a router to communicatively connect tothe first device 3000. Specifically, for example, the first audio sensor1110 may have a microphone and an analog digital converter (not shown inthe figures). The microphone is able to receive the first audio signalin the audio form from the first area, and the microphone transforms thefirst audio signal in the audio form to the first audio signal in theanalog form. In addition, the analog digital converter is able totransform the first audio signal in the analog form to the first audiosignal in the digital form

The processor 1700 is connected to the network through the networkbridge 1800, and is electrically connected to the first audio-videoplayer 1100 and the second audio-video player 1200 through the router1900. Specifically, the processor 1700 controls the first device 3000 inthe LAN (local area network) through the first audio-video player 1100,and controls the second device 5000 in the LAN (local area network)through the second audio-video player 1200.

For example, when the first audio sensor 1110 of the first audio-videoplayer 1100 receives the first audio signal in the first area, theprocessor 1700 is able to analyze this first audio signal. Then theprocessor 1700 determines or recognizes whether the source of this firstaudio signal is corresponding to a first account according to theanalyzed result. When the source of the first audio signal iscorresponding to the first account, the processor 1700 is able toprocess the first audio signal for obtaining the corresponding controlsignal, and the processor 1700 controls the first device 3000 by thecorresponding control signal so as to make the first device 3000 performthe corresponding function. Aforementioned source of the first audiosignal is associated with the object generating the first audio signal.Specifically, the processor 1700 analyzes the characteristic of thefirst audio signal and then compares with the acoustic fingerprinting ofthe first account. The processor 1700 processes the first audio signalfor obtaining the control signal when the processor 1700 determines thatthe characteristic of the first audio signal is matched with theacoustic fingerprinting of the first account. In an embodiment, when theprocessor 1700 determines that the characteristic of the first audiosignal is matched with the acoustic fingerprinting of the first accountand that the first audio signal is adapted for turning on or turning offthe first device 3000, the processor 1700 is able to turn on or turn offthe first device 3000 correspondingly. In another embodiment, when theprocessor 1700 determines that the characteristic of the first audiosignal is matched with the acoustic fingerprinting of the first accountand that the first audio signal is adapted for controlling the firstdevice 3000 to perform a specific function, the processor 1700 is ableto transmit the control signal correspondingly in order to make thefirst device 3000 perform abovementioned specific function. Furthermore,when the first device 3000 is an illuminating device with the functionof changeable brightness, abovementioned specific function may be thebrightness change of this illuminating device. Moreover, when the firstdevice 3000 is an air condition device with the function of severalchangeable parameters, abovementioned specific function may be thetemperature, the air volume or the shutdown time, etc.

In an embodiment, the user is able to set the audio-video control system1000 in a training mode through the first audio-video player 1100. Inthe training mode, the processor 1700 is able to compare thecharacteristic of the first audio signal with the acousticfingerprinting of the first account. When the processor 1700 determinesthat the characteristic of the first audio signal is matched withacoustic fingerprinting of the first account, the processor 1700 mayrecord the operations on the first device 3000 by the source of thefirst audio signal in a following period of time. Therefore, the firstinstruction set of the first account and the first area is built inabovementioned way. In the training mode, when the processor 1700determines that the characteristic of the first audio signal is notmatched with the acoustic fingerprinting of the first account, theprocessor 1700 transmits a prompt notice to add a new account for thesource of first audio signal.

Similarly, when the user sets the audio-video control system 1000 as thetraining mode through the second audio-video player 1200, when theprocessor 1700 determines that the characteristic of the second audiosignal is matched with the acoustic fingerprinting of the first accountin the second area, the processor 1700 may record the operation theoperations on the second device 5000 by the user in a following periodof time so as to build the corresponding second instruction set for thefirst account in the second area. On the other hand, when the processor1700 determines that the characteristic of the second audio signal inthe second area is not matched with the acoustic fingerprinting of thesecond account, the processor 1700 transmits a prompt notice to add anew account for the source of the second audio signal. Hence, the useris able to build N accounts corresponding to M areas, and there are N*Minstruction sets totally, wherein N and M are positive integers.

In another embodiment, the processor 1700 further stores the timeinformation in a storage medium HDD. Also, when the processor 1700processes the first audio signal according to the first instruction setfor obtaining the control signal, the processor 1700 is further able toconsult with the time information. That is, the processor 1700 in thisdisclosure not only generates aforementioned control instructionaccording to the first instruction set, but also generates the controlinstruction according to the time information. As a result, the controlinstruction may accord with the corresponding habit or the likes. In anembodiment, aforementioned time information may send the first audiosignal in a period during a predetermined period (such as a week or amonth) so as to make the first device 3000 perform the frequency ofspecific function. For example, the same user often turns off the aircondition and turns on the light in the master bedroom (first area) inthe morning, and often turns on the air condition and turns off thelight in the master bedroom at the night. Hence, the first instructionset is divided into a first sub-instruction set (corresponding tomorning) and a second sub-instruction set (corresponding to night), andit may speed up the analysis and feedback time of the instruction.Although in aforementioned embodiment, the electronic device connectedto the first audio-video player only includes the first device 3000 inthe first area, this disclosure is not limited by this feature. FIG. 2is the functional block diagram of the audio-video control systemaccording to one embodiment in this disclosure. Comparison with FIG. 1,the main difference is that there is a third device 4000 connected tothe first audio-video player 1100 through the network component 1140 inthe first area. The processor 1700 is able to send corresponding controlsignal according to the first audio signal so as to control the thirddevice 4000. Furthermore, there is a fourth device 6000 in the firstarea in FIG. 2, and since other parts are similar to the audio-videocontrol system 1000 in FIG. 1, the corresponding descriptions are notillustrated again herein. Specifically, a plurality of electronicdevices is able to communicatively connect to the processor 1700 throughthe first audio-video player 1100. The processor 1700 is able to analyzethe first audio signal, and is able to perform corresponding operationwith aforementioned electronic devices according to the analyticalresult.

Please refer to FIG. 1, the first instruction set of the first accountmay be saved in the cloud server 2000. By the network bridge 1800, theprocessor 1700 communicates to the cloud server 2000 through theinternet. The processor 1700 is able to send the first audio signal tothe cloud server 2000, and the cloud server 2000 processes the programsuch as the analysis and processing the first audio signal, ect. Hence,the processor 1700 generates the control instruction according to theresults of the analysis and processing, and the processor 1700 furtherobtains the control signal from the cloud server 2000 by the networkbridge 1800. Specifically, the first instruction set is saved in thelocal storage medium HDD in above embodiment, so necessary time forobtaining the control signal by processing the first audio signal couldbe shorter. However, the complexity and/or the amount of theinstructions are limited by the computing power of the processor 1700and the storage capacity of the storage medium HDD. In this embodiment,the first instruction set is saved in the cloud server 2000. In otherwords, the program of analysis and processing the first audio signal forobtaining the control signal is performed by the cloud server 2000.Therefore, a more complicated method “the artificial intelligence systembased on the neural network architecture” (performed in the cloudserver) may be performed, and more complex and more amount of theinstructions may be processed. However, the time spent for obtaining thecontrol signal by processing the first audio signal is longer than aboveembodiment.

In another embodiment, the user may watch the television and send theinstruction by voice in the first area (in the master bedroom). In thiscondition, the voice from the first audio-video player 1100 may be hardto be processed for the first audio signal.

In order to avoid the interference made by the voice from the firstaudio-video player 1100, and aforementioned interference may affect therecording quality of the first audio sensor 1100. As a result, inanother embodiment in this disclosure, the processor 1700 is able toobtain the audio signal playing currently (obtaining from the firstaudio-video player 1100 directly, rather than obtaining from therecording of the first audio sensor 1110) from the first audio-videoplayer 1100. Also, the processor 1700 performs the noise suppressionprocess for the first audio signal according the audio playing signalrecorded by the first audio sensor 1110, and obtains the third audiosignal. Then, the processor 1700 processes the third audio signal so asto obtain the control signal, wherein the processor 1700 obtains theaudio playing signal in the electronic signal form the first audio-videoplayer 1100 through the LAN (local area network) directly.

Specifically, please refer to FIG. 3, and FIG. 3 is the schematicdiagram of the noise suppression process according to one embodiment inthis disclosure. In FIG. 3, the first audio signal V1 is the audioplaying signal received by the first audio sensor 1110. The audio signalmay include the voice generated from the source of the first audiosignal (such as corresponding user of aforementioned first account), andthe audio signal actually played by the first audio-video player 1100.Also, the audio signal V2 is the audio playing signal (for example, theaudio stream from the audio and video channel system) obtained from thefirst audio-video player 1100 directly by the processor 1700, ratherthan the processed signal recorded by the first audio sensor 1110. AsFIG. 3 shows, the third audio signal V3 is obtained by deducting theaudio playing signal V2 from the first audio signal V1. Since the audioplaying signal V2 is sourced from the first audio-video player 1100,through removing the voice played by the first audio-video player 1100from the first audio signal V1 directly by the processor, the remainsignal should be the voice from the user in the first area.

Sine there is a time difference from the first audio player 1130 of thefirst audio-video player 1100 playing the voice to the first audiosensor 1110 receiving the voice, in order to cancel the interferencemade by the voice played by the first audio-video player 1100accurately, in an embodiment, the processor 1700 is able to obtain theaudio test signal which the first audio-video player 1100 want to playin the training mode directly, and the audio test signal actually playedby the first audio-video player 1100 is obtained by the first audiosensor 1110. Then, after the processor 1700 analyzes and processes thedirectly obtained audio test signal, and receives the audio test signal,a delay time difference T of two aforementioned audio test signals maybe obtained. Therefore, the third audio signal without the delay timedifference T is obtained by deducting the audio playing signal withoutthe delay from the first audio signal.

Additionally, the audio signal played by the first audio-video player1100 often includes the audio information and loudness information, andthe processor 1700 is not able to know the relation between the actualloudness information and the actual signal strength of the audio signalreceived by the first audio sensor 1110; also, the loudness is differentafter the voice actually played by the first audio-video player 1100 isreceived by the first audio sensor 1110. For these reasons, in anotherembodiment, the processor 1700 further builds a loudness adjustmentmodel according to the loudness information of the direct obtained audiotest signal and the audio test signal received by the first audio sensor1110. When the processor 1700 performs the noise suppression process,the processor 1700 calibrates the first audio signal obtained by thefirst audio sensor 1110 based on the loudness adjustment model at first,then the processor 1700 deducts the calibrated audio playing signal fromthe first audio signal so as to further obtain the third audio signal.Aforementioned embodiments may be implemented respectively or beimplemented together with each other, it is not limited in thisdisclosure. Furthermore, aforementioned embodiments may be implementedfor the second audio-video player 1200 in the same method.

Specifically, in the training mode, the processor 1700 controls thefirst audio-video player 1100 to use the audio test signal output fromthe first audio player 1130, wherein the audio test signal may be avoice with a volume setting of 20. The processor 1700 may obtain thecorresponding audio correction signal (for example, 50 dB) from thefirst audio sensor 1110. Also, the processor 1700 controls the firstaudio-video player 1100 to output the audio test signal, wherein theaudio test signal may be the voice with a volume setting of 15. Theprocessor 1700 may obtain the corresponding audio correction signal (forexample, 40 dB) from the first audio sensor 1110. Repeating above stepsmany times, by a plurality of audio test signals and a plurality ofcorresponding audio correction signals, the processor 1700 builds aloudness adjustment model for describing the relation between the audioplaying signal (or the audio test signal in the training mode) outputfrom the first audio player 1130 of the first audio-video player 1100,and the corresponding components in the first audio signal received fromthe first audio sensor 1110.

In an embodiment, when the processor 1700 receives the first audiosignal and the processor 1700 is not in the training mode, the processor1700 confirms whether the first audio-video player 1100 plays a programcurrently. When the first audio-video player 1100 plays the programcurrently, the processor 1700 obtains the current audio playing signalfrom the first audio-video player 1100, and after the processor 1700processes the audio playing signal by the loudness adjustment model forobtaining the corresponding waveforms and loudness in the first audiosignal, and according to the time difference obtained previously also,the processor 1700 removes the components corresponding to the audioplaying signal from the first audio signal for obtaining the secondaudio signal . In other words, in this embodiment, the audio playingsignal processed through the loudness adjustment model is removed fromthe first audio signal.

In another embodiment, when the processor 1700 receives the first audiosignal and the processor 1700 is not in the training mode, the processor1700 confirms whether the first audio-video player 1100 plays a programcurrently. When the first audio-video player 1100 plays the programcurrently, based on the relation described by the loudness adjustmentmodel, the processor 1700 transforms the information of the first audiosignal to the same form of the audio playing signal. Also, according tothe obtained time difference previously, the processor 1700 removes theaudio playing signal from the first audio signal which has beenprocessed by the loudness calibration for obtaining the second audiosignal. In other words, in this embodiment, it processes the first audiosignal by the loudness adjustment model in order to remove the audioplaying signal.

In some conditions, the area where the audio-video control system 1000is located, there may be a plurality of people there at the same time.Hence, in an embodiment, when the first audio sensor 1110 receives thefirst audio signal and a third audio signal simultaneously, theprocessor 1700 further determines whether the third audio signal iscorresponding to a second account. When the third audio signal is notcorresponding to the second account, the processor 1700 only processesthe first audio signal for obtaining the control signal. In other words,when the user used the audio-video control system 1000 has a visitor athome, since the visitor does not have an account set in the audio-videocontrol system 1000, the voice from the visitor is ignored.

In an embodiment in this disclosure, the storage medium HDD not onlysaves a plurality of instruction set associated with the first account(named the first account instruction set hereinafter), but also savesthe second account instruction set; wherein the first instruction set isassociated with the first account, and the second instruction set isassociated with the second account. When the processor 1700 determinesthe third audio signal is corresponding to the second account, theprocessor 1700 determines whether the first account is prior to thesecond account so as to determine to use the first account instructionset or the second account instruction set to generate the controlsignal. Specifically, if there are a father (the first account), amother (the second account), a sister (the third account) and a brother(the fourth account) in a family, wherein the priority is the secondaccount, the first account, the third account and the fourth account inorder. Among the people in the family, when they watch the television,the father often prefers the common light, the mother often prefers thelight with warm color, the sister often prefers to turn off all thelights and the brother often prefers to turn on the projection lampabove the television (the first audio-video player 1100). If the fatherand the mother send the instruction for switching the channel to themovie channel in the master bedroom (the first area) at the same time,the processor 1700 not only controls the first audio-video player 1100to switch the channel to the movie channel, but also controls the thirddevice 4000 (light) to switch to the light with warm color. Similarly,if the sister and the brother send the instruction for switching thechannel to the movie channel in the living room (the second area) at thesame time, the processor 1700 not only controls the second audio-videoplayer 1200 to switch the channel to the movie channel, but alsocontrols the fourth device 6000 (light) to turn off all the lights.

The first device to the fourth device in each aforementioned embodimentare described as the example like light and air condition, but thepeople with ordinary skill in the related art is able to replace theexample like light and air condition to other smart home appliancesaccording to the applications in practice, such as an electric stove, amicrowave oven, an ice machine or a monitor, etc. The feature is notlimited in this disclosure.

In some embodiments, based on the same concepts, it is able to capturethe sensing signal sensed by the temperature sensor and/or the humiditysensor of the air condition to the processor 1700. In other embodiments,a mobile device of a user is often connected to the LAN (local areanetwork) the same as the processor 1700, so the sensing signal of theambient light sensor of the mobile device is able to use as a lightsensor in specific area.

As a result, by integrating the audio sensor with the audio-videoplayer, it may be avoided to configure a plurality of the surplus audiosensors when the user sets the smart home appliances system.Furthermore, by integrating the audio sensor with the audio-videoplayer, it is able to be avoided to configure a plurality of the surplusaudio sensors when the user sets the smart home appliances system.

The embodiments depicted above and the appended drawings are exemplaryand are not intended to be exhaustive or to limit the scope of thepresent disclosure to the precise forms disclosed. Many modificationsand variations are possible in view of the above teachings.

What is claimed is:
 1. An audio-video control system adapted forcontrolling a first device in a first area and a second device in asecond area, the system comprising: a first audio-video player disposedin the first area and communicatively connected to the first device, thefirst audio-video player having a first audio sensor for receiving afirst audio signal in the first area; a second audio-video playerdisposed in the second area and communicatively connected to the seconddevice, the second audio-video player having a second audio sensor forreceiving a second audio signal in the second area; a routercommunicatively connected to the first audio-video player and the secondaudio-video player; and a processor communicatively connected to thefirst audio-video player and the second audio-video player through therouter, the processor adapted for processing the first audio signal togenerate a first control signal and for processing the second audiosignal to generate a second control signal, the processor sending thefirst control signal to the first device through the router and thefirst audio-video player, and the processor sending the second controlsignal to the second device through the router and the secondaudio-video player.
 2. The audio-video control system according to claim1, further comprising: a plurality of first sensors disposed in thefirst area, the first sensors adapted for collecting a plurality offirst sensing signals; wherein the processor generates the first controlsignal according to the first sensing signals and the first audiosignal.
 3. The audio-video control system according to claim 1, furthercomprising a storage medium storing a first instruction set associatedwith the first area, wherein the processor analyzes the first audiosignal to obtain the control signal according to the first instructionset when the processor processes the first audio signal.
 4. Theaudio-video control system according to claim 2, wherein the processorfurther obtains an audio playing signal from the first audio-videoplayer, and the processor performs a noise suppression process for thefirst audio signal according to the audio playing signal so as to obtaina third audio signal, and the processor processes the third audio signalfor obtaining the control signal.
 5. The audio-video control systemaccording to claim 4, wherein the processor further controls the firstaudio-video player to play an audio test signal in a test mode, and theprocessor controls the first audio sensor by a controller to receive anaudio correction signal, the processor obtains a time difference basedon the audio test signal and the audio correction signal, and theprocessor further performs the noise suppression process for the firstaudio signal based on the time difference when the processor performsthe noise suppression process for the first audio signal according tothe audio playing signal for obtaining the third audio signal.
 6. Theaudio-video control system according to claim 4, wherein the processorfurther controls the first audio-video player to play an audio testsignal in a test mode, and the processor controls the first audio sensorby a controller to receive an audio correction signal, the processorobtains a loudness correction model according to the audio test signaland the audio correction signal, and the processor further performs thenoise suppression process for the first audio signal based on theloudness correction model when the processor performs the noisesuppression process for the first audio signal according to the audioplaying signal for obtaining the third audio signal.
 7. The audio-videocontrol system according to claim 3, wherein the processor furtherobtains an audio playing signal from the first audio-video player, andthe processor performs a noise suppression process for the first audiosignal according to the audio playing signal so as to obtain a thirdaudio signal, and the processor processes the third audio signal forobtaining the control signal.
 8. The audio-video control systemaccording to claim 7, wherein the processor further controls the firstaudio-video player to play an audio test signal in a test mode, and theprocessor controls the first audio sensor by a controller to receive anaudio correction signal, the processor obtains a time difference basedon the audio test signal and the audio correction signal, and theprocessor further performs the noise suppression process for the firstaudio signal based on the time difference when the processor performsthe noise suppression process for the first audio signal according tothe audio playing signal for obtaining the third audio signal.
 9. Theaudio-video control system according to claim 7, wherein the processorfurther controls the first audio-video player to play an audio testsignal in a test mode, and the processor controls the first audio sensorby a controller to receive an audio correction signal, the processorobtains a loudness correction model according to the audio test signaland the audio correction signal, and the processor further performs thenoise suppression process for the first audio signal based on theloudness correction model when the processor performs the noisesuppression process for the first audio signal according to the audioplaying signal for obtaining the third audio signal.
 10. The audio-videocontrol system according to claim 1, wherein the processor furtherdetermines whether a third audio signal is corresponding to a secondaccount when the first audio sensor simultaneously receives the firstaudio signal and the third audio signal, and the processor onlyprocesses the first audio signal for obtaining the control signal whenthe third audio signal is not corresponding to the second account. 11.The audio-video control system according to claim 10, further comprisinga storage medium storing a first account instruction set and a secondaccount instruction set, wherein the first account instruction set isassociated with the first account, and the second account instructionset is associated with the second account, and the processor determineswhether the first account is prior to the second account so as todetermine which one of the first account instruction set and the secondaccount instruction set is used to generate the control signal when thethird audio signal is corresponding to the second account.
 12. Theaudio-video control system according to claim 1, wherein the processorobtains the control signal further according to a time information.